Automatic audio-video switching

ABSTRACT

A technique is disclosed for automatic switching of a mobile device between a videoconference operating mode and a teleconference operating mode, where the automatic switching is based on a detected functional use and current position of the device in relation to a user of the device. In at least some embodiments, the disclosed technology includes a video-audio switching system that works in coordination with a sensor mechanism associated with the device to enable the automatic switching. The technology disclosed herein enables the user to carry out a conversation via the device (i.e., conference session) without interruption when transitioning between different settings, (e.g., using the device while sitting at an office to using it while on-the-go to an appointment).

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No.14/618,716, filed Feb. 10, 2015, entitled “AUTOMATIC AUDIO-VIDEOSWITCHING”, which claims the benefit of U.S. Provisional PatentApplication No. 61/937,979 filed Feb. 10, 2014, entitled “AUTOMATICAUDIO-VIDEO SWITCHING”, all of which are expressly incorporated byreference herein.

BACKGROUND

Mobile devices, such as smartphones and personal digital assistants(PDAs), are versatile computing machines that provide users the abilityto communicate with one another. The devices' mobile capability allowsusers to communicate in settings beyond those of traditional, land linetelephones, i.e., on-the-go, in any setting. For example, a user canmake a teleconference call while walking using a mobile device. The sameuser can later change to a videoconference call using the same mobiledevice.

For each type of call, however, the user is required to interactmanually with the device in order to carry out the operation (e.g.,punch in keypresses to activate video camera for a videoconference).Such manual interaction is disadvantageous as it requires the user tolook at the device; this is cumbersome when the user is transitioningbetween different settings during a conversation (e.g., driving,walking, multitasking, etc.). Another disadvantage is that a time lagwill likely result in order for actual execution of the appropriateoperation to occur, preventing the user from having a seamlessconversation via the device.

BRIEF DESCRIPTION OF THE DRAWINGS

One or more embodiments of the present invention are illustrated by wayof example and are not limited by the figures of the accompanyingdrawings, in which like references indicate similar elements.

FIG. 1 illustrates an environment in which various embodiments of theaudio-video switching technology can be implemented.

FIG. 2 illustrates conceptually the audio-video switching technologybeing utilized in association with a smartphone.

FIG. 3 illustrates a first embodiment of a process for managing audioand video modes associated with a mobile device during avideoconference.

FIG. 4 illustrates a second embodiment of a process for managing audioand video modes associated with a mobile device during avideoconference.

FIG. 5 illustrates a third embodiment of a process for managing audioand video modes associated with a mobile device during avideoconference.

FIG. 6 illustrates a computer system that can be utilized to performvarious techniques disclosed herein.

DETAILED DESCRIPTION

References in this description to an embodiment“, one embodiment”, orthe like, mean that the particular feature, function, structure orcharacteristic being described is included in at least one embodiment ofthe present invention. Occurrences of such phrases in this specificationdo not necessarily all refer to the same embodiment. On the other hand,such references are not necessarily mutually exclusive.

General Description

Introduced herein is a technology for automatic switching of a mobiledevice between a videoconference operating mode and a teleconferenceoperating mode, where the automatic switching is based on a detectedfunctional use and current position of the device in relation to a userof the device. The technology disclosed herein enables the user to carryout a conversation via the device without interruption whentransitioning between different settings (where each setting may placethe device in a different position unsuitable for a current operatingmode).

The term “setting,” as used herein, refers to a context, or anenvironment, under which the user utilizes the mobile device. Forexample, the setting can be the device being held at a distance awayfrom the user's ear. In such setting, the user is likely using thedevice, for example, to conduct a videoconference session while sittingin an office. In another example, the setting can be the device beingused inside a pocket or a bag of the user, where a hands-free headset ora speakerphone of the device is enabled to permit communication via thedevice. In this setting, the user is likely using the device, forexample, to conduct a teleconference session while on-the-go (e.g.,transitioning from the office to another meeting or appointment).

In at least some embodiments, the disclosed technology includes avideo-audio switching system that facilitates the automatic switching ofa mobile device's operating mode to enable communication via the devicewithout interruption. A mobile device often includes one or moreoperating modes for which the user can activate to accommodate thesetting in which the user utilizes the device. For example, avideoconference-enabled mode activates the device's camera function toenable transmission of video and audio from the user to other users in avideoconference session. In another example, a teleconference-enabledmode activates the device's speaker function to receive audio in ateleconference session.

In at least some embodiments, the video-audio switching system works incoordination with a sensor mechanism associated with the device toenable the automatic switching. The sensor mechanism is configured todetect and transmit a signal indicative of a current position of thedevice in relation to the user. The detection mechanism can include oneor more sensors configured to detect various attributes associated withthe current position. The sensor mechanism can include a proximitysensor that detects the position, or proximity, of the mobile device inrelation to one or more specified spatial areas associated with theuser's body. The various attributes can include, for example, aproximity measurement, such as a proximate distance between the deviceand a user of the device (e.g., x-distance proximate to the user's orthe user's ear) or a proximate location of the device in relation to theuser (e.g., x, y, z coordinates of a three-dimensional space). Thevarious attributes can also include an acceleration measurement (e.g.,acceleration associated with movement of the device) or an orientationmeasurement (e.g., an upright orientation, a sideway orientation, upsidedown orientation).

In at least some embodiments, the video-audio switching system, inresponse to the current position detected by the sensor mechanism,activates a particular operating mode that corresponds to the currentposition. That is, the user does not need to manually execute a keysequence (e.g., keypresses) in order to have the device operate underthe appropriate mode. In at least some embodiments, the video-audioswitching system analyzes the device's position as an indication of aparticular setting, or context, in which the user is utilizing thedevice. For example, when the device is detected as being sideway andaway from the user's ear (e.g., in user's pocket), the video-audioswitching system correlates the position to an indication that thedevice is likely being used for a teleconference session.

In at least some embodiments, the video-audio switching systemautomatically switches the mobile device between the different operatingmodes seamlessly in response to the detected current position. Uponreceiving a signal from the sensor mechanism, the video-audio switchingsystem determines whether a current functional use of the device, i.e.,the current operating mode, corresponds to the position detected by thesensor mechanism. For example, if the device is operating undervideoconference mode, but the current position indicates that the deviceis oriented proximate to the user's ear, the video-audio switchingsystem automatically switches the device to teleconference mode. In suchexample, the video-audio switching system analyzes the detected positionto be unfit for use in a videoconference mode, and causes the device toswitch to teleconference mode. Referring to the same example, if thevideo-audio switching system receives from the sensor mechanism a newupdated signal indicating the device is oriented in a position fit for avideoconference, the system will automatically switch the device back tovideoconference mode.

In at least some embodiments, the automatic switching of the mobiledevice to teleconference mode includes rerouting the sound transmissionof the conference session. The rerouting includes directing the soundtransmission from the device's speakerphone to the speaker at the user'sear to correspond with the current position detected. In someembodiments, the sound transmission is rerouted from the speakerphone toa hands-free headset connected to the device.

In at least some embodiments, the automatic switching of the mobiledevice to teleconference mode includes deactivating unnecessaryvideoconference related capabilities of the device. For example, thecamera capability is deactivated, or turned off, when teleconferencemode is enabled.

In at least some embodiments, the automatic switching of the mobiledevice to teleconference mode includes generating an avatar of the user.The generated avatar replaces a video image of the user in the previousvideoconference session. One advantage of having the avatar is beingable to provide a more natural transition for the user in the videochat. For example, instead of a missing video live stream of the user'sface, the avatar is displayed to the other users of the videoconferencewhen the user switches to teleconference mode. In some embodiments, theavatar can emulate facial expressions of the user speaking, wherein thefacial expressions correspond to the audio being transmitted to theother users in the videoconference.

In at least some embodiments, the automatic switching of the mobiledevice to videoconference mode includes rerouting the sound transmissionof the conference session. The rerouting includes directing the soundtransmission from the device's ear speaker to the speakerphone tocorrespond to the current position detected. In some embodiments, thesound transmission is rerouted from a hands-free headset connected tothe device to the speakerphone.

In at least some embodiments, the automatic switching of the mobiledevice to videoconference mode includes activating certainvideoconference related capabilities of the device. For example, thecamera capability is activated, or turned on, when videoconference modeis enabled.

The technology disclosed herein provide many advantages. One advantageis automatic mode switching for the mobile device without the userhaving to manually press a button or a sequence of keys to transitionbetween different settings. Another advantage is preservation ofresources associated with the mobile device. For example, when inteleconference mode, the video-audio switching system deactivates thecamera and the speakerphone of the device as they are no longer in use;such deactivation saves battery life and prolongs the life of the devicecomponents. Another advantage of the disclosed technology is a seamlesstransition presented to a user when the mobile device is transitioningbetween teleconference and videoconference modes, such that the user'scommunication session is uninterrupted. For example, when switching fromvideoconference to teleconference mode, the user's video camera feed isreplaced automatically with an avatar to allow a natural progression ofthe videoconference to continue.

Other aspects and advantages of the disclosed technology will becomeapparent from the following description in combination with theaccompanying drawings, illustrating, by way of example, the principlesof the disclosed technology.

FIG. 1 is a block diagram of an environment 100 in which variousembodiments of the video-audio switching technology can be implemented.Although the diagram depicts components as functionally separate, suchdepiction is merely for illustrative purposes. It will be apparent toone of ordinary skill in the art that the components portrayed in thisfigure can be arbitrarily combined or divided into separate software,firmware, and/or hardware components. Furthermore, it will also beapparent to one of ordinary skill in the art that such components,regardless of how they are combined or divided, can execute on the samehost or multiple hosts, and wherein multiple hosts can be connected byone or more networks.

It is noted that while the discussion focuses on operating modescorresponding to videoconference and/or teleconference sessions, otheroperating modes associated with the device may also benefit from thetechnology disclosed herein; that is, the technology is applicable toenable automatic switching between various operating modes of the devicebased on detection of the mobile device's position (e.g., proximity,orientation, etc.).

The environment 100 includes a conference system 110 that facilitatesone or more conference sessions (i.e., videoconferences and/orteleconferences) taking place between two or more users, such as betweena first user using a first client device 120 and other users (e.g.,second user, third user, etc.) using respective client devices 140A-N(e.g., second device, third device, etc.). For example, the first user,via the first client device 120, can send a request to the conferencesystem 110 to initiate a videoconference session, or “video chat,” withone or more other users. In response, the conference system 110, canfacilitate the video chat by connecting the client device 120 torespective client devices 140A-N of the one or more users.

The communication between the client devices 120, 140A-N and theconference system 110 can occur over a communication network 102. Thecommunication network 102 can be of various types, including a localarea network, a wide area network, the Internet, or a public switchedtelephone network.

The client devices 120, 140A-N can be any general-purpose devices withdata processing and data communication capabilities. For example, thefirst client device 120 can be a mobile phone, a tablet, an e-reader, orany other mobile or portable computing devices. The first client device120 can include, for example, two-way communication devices withadvanced data communication capabilities. The client device 120 can havethe capability to allow voice and video conversations (e.g., video chator audio chat) with other voice/video capable devices, such as aninternet telephone, another client device 140A, or the like. Each of theclient devices 140A-N can be a mobile computing device, similar to thefirst client device 120, or any other non-mobile computing device, suchas a desktop computer, with data processing and data communicationcapabilities.

The first client device 120 can include a processor 122, a detectionmechanism 124, a camera 128, a display 130, one or more speaker(s) 132,and a microphone 134. The display 130 can include a display screen, suchas a liquid crystal display (LCD) or an organic light emitting diodedisplay (OLED). The one or more speaker(s) 132 can include aspeakerphone, an ear speaker, or a speaker module for relaying audio toan external speaker. The external speaker can be, for example, an earbud of a hands-free headset.

The detection mechanism 124 can be any combination of software and/orhardware. In some embodiments, the detection mechanism 124 works incoordination with the processor 122 to execute instructions receivedfrom the conference system 110. The detection mechanism 124 can includeone or more sensors 126 configured to detect an operating position ofthe client device 120. The sensors 126 can include a proximity sensorconfigured to detect a proximate distance of the client device 120 inrelation to an operator, or user, of the device. While a proximitysensor is sufficient to detect the device's position, other sensors canbe used in combination with the proximity sensor in other embodiments.The other types of sensors can include, for example, a gyroscope, anaccelerometer, and the like.

In some embodiments, the detection mechanism 124 can generate andtransmit to the video-audio switching system 112 a signal indicative ofa position of the device using the sensors 126. In one example, thesignal can include an orientation measurement of the device relative tothe user. The orientation measurement can be, for example, upside-down,right side up, sideways, or any other orientation. In another example,the signal can include a proximity of the device relative to a user ofthe device. The proximity can include, for example, a quantification ofa distance between the device and the user. In some embodiments, theproximity sensor can be configured to incorporate a time delay indetecting the proximity to prevent “false alarm” detection. The timedelay is useful, for example, when a user's finger temporarily passesover the proximity sensor. In such instance, the proximity sensor waitsfor a predetermined time (e.g., 700 milliseconds)—ensuring thatproximity persists in the predetermined time—before concluding thedevice is within proximate contact with a particular user (or object,body mass, etc.). The delay mechanism can be incorporated into theprocessor 122 instead.

In some embodiments, the conference system 110 includes a video-audioswitching system 112 (or “switching system”) configured to facilitateautomatic switching of the operating modes based on the signal from thedetection mechanism 124. In some embodiments, the switching system 112is an external system that works in coordination with the conferencesystem 110 to facilitate the automatic switching capability. Theswitching system 112 causes the first client device 120 to alternate, orswitch, between a videoconference operating mode (or, “video mode”) anda teleconference operating mode (or, “audio mode”) in response to thedevice 120 transitioning between different settings during a conferencesession (or “conference call”). As discussed above, a setting refers toan environment or context in which the device is being used in relationto the user.

In some embodiments, the switching system 112 works in coordination withthe processor 122 and the detection mechanism 124 of the client device120 to facilitate the automatic switching. The switching system 112first determines a particular setting of the device 120 based on asignal indicative of the device's position, or orientation, in relationto the user using the device. For example, where the client device 120is identified as being in a position away from the user, the switchingsystem 112 correlates such position to a setting associated with thedevice being used for a videoconference session. Such correlation isbased on, for example, a logic that the device's position allows thedevice's camera to capture real-time video footage of the user (e.g.,device being away from user's face). The logic may be configured by anoperator of the switching system 112.

In response to the detected current position, the switching system 112determines a current functional use of the device, and determineswhether such use corresponds to the current position. If the currentfunctional use corresponds to the current position, then the switchingsystem 112 takes no action. If the current functional use does notcorrespond to the current position, then the switching system 112 workswith the processor 122 to cause the operating mode of the device 120 toalternate, or switch, to an operating mode that places the functionaluse of the device 120 in accordance with the current position.

FIG. 2 illustrates the audio-video switching technology being utilizedin association with a mobile device 200, such as an iPhone®, accordingto some embodiments. The mobile device 200 includes a camera 202, an earspeaker 204, a display 206, a speakerphone 208, and a microphone 210. AUser X 212 can utilize the mobile device 200 to conduct a conferencesession with one or more other users, e.g., User Y and User Z (notshown).

As discussed above, the mobile device 200 can work in coordination witha video-audio switching system to facilitate smooth operation of thedevice 200 when transitioning between different settings during aconference session (i.e., where the device assumes different positionswhile participating in the session). The conference session can be, forexample, a videoconference session or a teleconference session.

For illustrative purposes, consider scenarios 220, 230, and 240 to bedemonstrative of different settings between which User X can transitionwhile participating in a conference session with Users Y and Z. Duringeach scenario, the video-audio switching system can detect a currentposition and a current operating mode of the mobile device 200, andcauses, in response to such detection, the mobile device to switch to anappropriate operating mode to provide a seamless transition for the userduring the conference session.

In scenario 220, consider an example where User X 212 initiates videomode in the mobile device 200 to start a videoconference session (or“video chat”) with Users Y and Z. To carry out the video chat, the UserX places the phone at a position 222, away from the user's face, suchthat the camera 202 of the device 200 is able to detect, record, andtransmit a live image of the user for the video chat. In scenario 220,the video-audio switching system detects such position 222 and thecurrent operating mode of the mobile device 200. In some embodiments,the video-audio switching system detects the position 222 based on asignal received from a detection mechanism (e.g., detection mechanism124 of FIG. 1) coupled to the video-audio switching system. Based on thedetected position 222 (i.e., away from User X) and the detected currentoperating mode (i.e., video mode), the video-audio switching systemdetermines whether the video mode corresponds to the detected position222.

The determination whether the operating mode corresponds to the detectedposition can be based on a configurable logic. In some embodiments, thelogic can be changed, or updated, by an operator of the video-audioswitching system. The logic can be, for example, if the position isx-distance from the user, where the x-distance exceeds a threshold(e.g., 3 feet), then video mode is the appropriate operating mode (e.g.,initiate video mode). In yet another example, the logic can be if theposition is x-distance from the user and oriented in y-orientation, thenvideo mode is the appropriate operating mode. The logic can also bebased on a predetermined threshold. For example, the logic can include adetermination that if the x-distance, between the device and the user,exceeds a threshold (e.g., 3 feet), then video mode is the appropriateoperating mode (e.g., initiate video mode); otherwise, the audio mode isthe appropriate operating mode (where the y-distance from the floor canbe as high as the ear level or as low as the pocket level). In anotherexample, the logic can include a determination that if the orientationexceeds a certain axis by z-degrees, then an audio mode is theappropriate operating mode. Other variations and/or combinations ofthese features can be readily apparent to one of ordinary skill in theart based on the disclosure here.

Following the logic, the video-audio switching system determines inscenario 220 that position 222 corresponds to the current video mode.The video-audio switching system maintains the video mode and does notinterfere with the mobile device's functioning operations.

Consider another example for scenario 220, where User X starts outinitiating an audio mode to start a teleconference, or voice chat(instead of a video chat) with Users Y and Z. In such example, thevideo-audio switching system detects the position 222 and the currentoperating mode (i.e., audio mode), determines that the operating modedoes not correspond with the position 222 (i.e., corresponds with videomode), and causes, in response to such determination, the mobile device200 to switch to video mode. This switching can include, for example,rerouting the audio transmission from the small ear speaker 204 to thespeakerphone 208 and activating the camera 202.

In scenario 230, consider an example where User X 212 starts out invideo mode, continues the video chat with Users Y and Z, and transitionsinto a different setting by placing the mobile device 200 in position232, next to the user's ear. Similar to scenario 220, the video-audioswitching system determines whether the current operating mode of thedevice 200 corresponds to the position 232. In such position 232, thevideo-audio switching system determines that the camera 202 of thedevice 200 is effectively blocked, unable to detect, record, or transmita live image of User X. This determination can be based on various logicthat is configurable, for example, by an operator of the video-audioswitching system.

In the position 232, the video-audio switching system determines thatthe video mode, which has been initially started by User X, is no longeran appropriate operating mode. In response, the video-audio switchingsystem causes the mobile device 200 to switch to an audio mode. Theswitching can include, for example, rerouting the audio transmissionfrom the speakerphone 208 to the small ear speaker 204, deactivating thecamera 202, and/or generating an avatar for transmission and display tothe Users Y and Z in the conference session. In some embodiments, theavatar can indicate that User X has the phone by his or her ear orotherwise is only listening to the audio without watching any video. Insome embodiments, the avatar can emulate facial expressions of User Xbased on the audio received via the microphone 210, including a tone ofthe voice or the speech contained in the voice, to show what User Xmight be feeling or speaking.

In scenario 240, consider an example where User X transitions into yetanother setting by moving the mobile device 200 from position 232 toposition 242, away from the user's ear (i.e., into User X's pocket).Similar to scenarios 220, 230, the video-audio switching systemdetermines whether the current operating mode corresponds with thedetected position 242. In the position 242, the video-audio switchingsystem determines that the camera 202 of the device 200 is effectivelyblocked, unable to detect, record, or transmit a live image of User X,determines that the audio mode is the appropriate operating mode, andmaintains the audio mode for the device 200. This determination can bebased on various logic configurable, for example, by an operator of thevideo-audio switching system. The logic can include, for example, if thedevice 200 is at an x-distance away from User X, where the x-distance iswithin a threshold (e.g., three inches), then initiate audio mode. Inthe example of scenario 240, the video-audio switching system causes thedevice to maintain the audio mode (from previous position 232) as suchmode corresponds to the position 242.

FIG. 3 illustrates a first embodiment of a process 300 for managingaudio and video modes associated with a user device during a conferencesession. The process 300 can be executed by a video-audio switchingsystem of a conference system (e.g., the conference system 110 of FIG.1), or a video-audio switching system that is external to the conferencesystem and works in coordination with the conference system and the userdevice, or a video-audio switching mobile application installed on theuser device.

At step 302, the video-audio switching system (or “VAS system”) detectsa conference call is operating on a mobile device. In particular, theVAS system determines the type of conference call currently taking placeon the device. The type of conference call can be, for example, avideoconference call or a teleconference call. Determination of anactive conference call causes the VAS system to start monitoring thecurrent position of the device and the current operation mode of thedevice, as indicated in step 304.

At step 306, the VAS system determines a current position associatedwith the conference call operating on the mobile device. The VAS systemcan determine the current position by, for example, receiving a signalindicative of the position from a detection system associated with thedevice (e.g., detection mechanism 124 of FIG. 1).

At step 308, the VAS system determines a current operation mode of thedevice. The operation mode can include, for example, a video mode (i.e.,an operation mode activating videoconference-related functionalcomponents of the device) or an audio mode (i.e., an operation modeactivating teleconference-related functional components of the device).

At step 310, the VAS system determines whether the current operationmode corresponds to the current position. At step 312, in response todetermining that the current operation mode does not correspond (i.e.,not the appropriate mode), the VAS system causes the mobile device toswitch mode to an appropriate operation mode. If the current operationmode corresponds to the current position, the VAS system continuesmonitoring the current position and the current operation mode, asindicated in step 304.

FIG. 4 illustrates a second embodiment of a process 400 for managingaudio and video modes associated with a user device during avideoconference. The process 400 can be executed by a video-audioswitching system of a conference system (e.g., the conference system 110of FIG. 1), or a video-audio switching system that is external to theconference system and works in coordination with the conference systemand the user device, or a video-audio switching mobile applicationinstalled on the user device.

At step 402, the video-audio switching system (“VAS system”) detects acurrent position indicative of the mobile device operating to enableparticipation in a teleconference. The current position can be based ona signal from a detection system (e.g., detection mechanism 124 of FIG.1). The current position can include, for example, the device beingwithin a proximately close distance to a user of the device, such asbeing next to the user's ear. The VAS system determines such position isindicative of the device being used for a voice-related call (e.g., ateleconference session). As discussed above, such determination can bebased on a configurable logic associated with the VAS system. Forexample, the logic can include setting a position proximately close tothe user to indicate that the device is currently being used for ateleconference. At step 404, the VAS system causes the device to switchto an audio mode to correspond to the device being in teleconferenceoperation.

FIG. 5 illustrates a third embodiment of a process 500 for managingaudio and video modes associated with a user device during avideoconference. The process 500 can be executed by a video-audioswitching system of a conference system (e.g., the conference system 110of FIG. 1), or a video-audio switching system that is external to theconference system and works in coordination with the conference systemand the user device, or a video-audio switching mobile applicationinstalled on the user device.

At step 502, the video-audio switching system (“VAS system”) detects acurrent position indicative of the mobile device operating to enableparticipation in a videoconference. The current position can be based ona signal from a detection system (e.g., detection mechanism 124 of FIG.1). The current position can include, for example, the device being at adistance away from a user of the device, such as being away from theuser's ear. The VAS system determines such position is indicative of thedevice being used for a video-related call (e.g., a videoconferencesession). As discussed above, such determination can be based on aconfigurable logic associated with the VAS system. For example, thelogic can include setting a position that is away from the user (above apredetermined threshold) to indicate that the device is currently beingused for a videoconference. At step 504, the VAS system causes thedevice to switch to a video mode to correspond to the device being invideoconference operation.

Regarding the processes 300, 400, and 500, while the various steps,blocks, or sub-processes are presented in a given order, alternativeembodiments can perform routines having steps, or employ systems havingsteps, blocks or sub-processes, in a different order. Further, somesteps, sub-processes or blocks can be deleted, moved, added, subdivided,combined, and/or modified to provide alternatives or sub-combinations.Each of these steps, blocks or sub-processes can be implemented in avariety of different ways. Also, while the steps, sub-processes orblocks are at times shown as being performed in series, some steps,sub-processes, or blocks can be performed, instead, in parallel or atdifferent times, as will be readily recognized by a person of ordinaryskill in the art in light of the disclosure herein. Additionally, anyspecific numbers noted herein are only by example as alternativeimplementations can employ differing values or ranges.

FIG. 6 illustrates a computer system 600 that can be utilized to performvarious techniques disclosed herein. The computer system 600 includes aprocessor 602, a memory 604, non-volatile memory 606, a networkinterface 608, input/output (I/O) devices 610, and a drive unit 612.Various common components (e.g., cache memory) are omitted forillustrative simplicity.

The computer system 600 is intended to illustrate a hardware device onwhich any of the components depicted in FIGS. 1-5 (and any othercomponents described in this specification) can be implemented. Forexample, the components can include the conference system 110 of FIG. 1,the video-audio switching system 112 of FIG. 1, or the user devices 120,140A-N. The computer system 600 can be of any applicable known orconvenient type. The components of the computer system 600 can becoupled together via a bus 620 or through some other known or convenientdevice.

The computer system 600 can be embodied in any suitable physical form.For example, the computer system 600 can be an embedded computer system,a system-on-chip (SOC), a single-board computer system (SBC) (such as,for example, a computer-on-module (COM) or system-on-module (SOM)), adesktop computer system, a laptop or notebook computer system, aninteractive kiosk, a mainframe, a mesh of computer systems, a mobiletelephone, a personal digital assistant (PDA), a server, or acombination of two or more of these. Where appropriate, computer system600 can include one or more computer systems 600; be unitary ordistributed; span multiple locations; span multiple machines; or residein a cloud, which can include one or more cloud components in one ormore networks. Where appropriate, one or more computer systems 600 canperform without substantial spatial or temporal limitation one or moresteps of one or more methods described or illustrated herein. Forexample, one or more computer systems 600 can perform in real time or inbatch mode one or more steps of one or more methods described orillustrated herein. The one or more computer systems 600 can perform atdifferent times or at different locations one or more steps of one ormore methods described or illustrated herein, where appropriate.

The processor 602 can be a conventional microprocessor, such as an IntelCore microprocessor, an Intel Itanium microprocessor, a Motorola powerPC microprocessor, or a SPARC architecture processor. One of skill inthe relevant art will recognize that the terms “machine-readable(storage) medium” or “computer-readable (storage) medium” include anytype of device that is accessible by the processor.

The memory 604 is coupled to the processor 602 by, for example, a bus.The memory 604 can include, for example, random access memory (RAM),such as dynamic RAM (DRAM) or static RAM (SRAM). The memory can belocal, remote, or distributed.

The bus 620 also couples the processor 602 to the non-volatile memory606 and the drive unit 612. The non-volatile memory 606 can be, forexample, a magnetic floppy or hard disk, a magnetic-optical disk, anoptical disk, a flash memory such as NAND flash memory or NOR flashmemory, a read-only memory (ROM) such as a CD-ROM, a programmableread-only memory such as EPROM or EEPROM, a magnetic or optical card, oranother form of storage for large amounts of data. Some of this data isoften written, by a direct memory access process, into memory duringexecution of software in the computer 600. The non-volatile storage canbe local, remote, or distributed. The non-volatile memory is optionalbecause systems can be created with all applicable data available inmemory. A typical computer system will usually include at least aprocessor, memory, and a device (e.g., a bus) coupling the memory to theprocessor.

Software is typically stored in the non-volatile memory and/or the driveunit. Indeed, for large programs, storing the entire program in memorycan not even be possible. Nevertheless, one should understand that forsoftware to run, if necessary, the software is moved to a computerreadable location appropriate for processing, and for illustrativepurposes, that location is referred to as the memory in this disclosure.Even when software is moved to the memory for execution, the processorwill typically make use of hardware registers to store values associatedwith the software, and local cache that, ideally, serves to speed upexecution. As used herein, a software program is assumed to be stored atany known or convenient location (from non-volatile storage to hardwareregisters) when the software program is referred to as “implemented in acomputer-readable medium.” A processor is considered to be “configuredto execute a program” when at least one value associated with theprogram is stored in a register readable by the processor.

The bus 620 also couples the processor 602 to the network interfacedevice 608. The interface 608 can include one or more of a modem ornetwork interface. A person of ordinary skill will appreciate that amodem or network interface can be considered to be part of the computersystem 600. The interface 608 can include an analog modem, ISDN modem,cable modem, token ring interface, satellite transmission interface(e.g., “direct PC”), Wi-Fi interface, or other interfaces for coupling acomputer system to other computer systems. The interface can include oneor more input and/or output devices.

The I/O devices 610 can include, for example, a keyboard, a mouse orother pointing device, disk drives, printers, a scanner, and other inputand/or output devices, including a display device. The display devicecan include, for example, a cathode ray tube (CRT), liquid crystaldisplay (LCD), or some other applicable known or convenient displaydevice. For simplicity, this disclosure assumes that controllers of anydevices not depicted in the example of FIG. 6 reside in the interface.

The computer system can have one Bus or multiple Buses. The bus 620 caninclude, for example, a system bus, a Peripheral Component Interconnect(PCI) bus or PCI-Express bus, a HyperTransport or industry standardarchitecture (ISA) bus, a small computer system interface (SCSI) bus, auniversal serial bus (USB, USB 2.0, USB 3.0), IIC (I2C) bus, anInstitute of Electrical and Electronics Engineers (IEEE) standard 1394bus, also called “Firewire,” a QuickPath Interconnect bus, a ThunderBoltinterconnect bus, a DisplayPort interconnect bus or its companionstandards Mini DisplayPort (mDP), Direct Drive Monitor (DDM), EmbeddedDisplayPort (eDP), Internal DisplayPort (iDP), Portable Digital MediaInterface (PDMI), Wireless DisplayPort (wDP), and Mobility DisplayPort(MyDP), an HDMI interconnect bus, a DVI bus.

In operation, the computer system 600 can be controlled by operatingsystem software that includes a file management system, such as a diskoperating system. One example of operating system software withassociated file management system software is the family of operatingsystems known as Windows® from Microsoft Corporation of Redmond, Wash.,and their associated file management systems. Another example ofoperating system software with its associated file management systemsoftware is the Linux™ operating system and its associated filemanagement system. The file management system is typically stored in thenon-volatile memory and/or drive unit and causes the processor toexecute the various acts required by the operating system to input andoutput data and to store data in the memory, including storing files onthe non-volatile memory and/or drive unit.

SUMMARY

The foregoing description of the embodiments of the invention has beenpresented for the purpose of illustration; it is not intended to beexhaustive or to limit the invention to the precise forms disclosed.Persons skilled in the relevant art can appreciate that manymodifications and variations are possible in light of the abovedisclosure.

Some portions of this description describe the embodiments of theinvention in terms of algorithms and symbolic representations ofoperations on information. These algorithmic descriptions andrepresentations are commonly used by those skilled in the dataprocessing arts to convey the substance of their work effectively toothers skilled in the art. These operations, while describedfunctionally, computationally, or logically, are understood to beimplemented by computer programs or equivalent electrical circuits,microcode, or the like. Furthermore, it has also proven convenient attimes, to refer to these arrangements of operations as modules, withoutloss of generality. The described operations and their associatedmodules can be embodied in software, firmware, hardware, or anycombinations thereof.

Any of the steps, operations, or processes described herein can beperformed or implemented with one or more hardware or software modules,alone or in combination with other devices. In one embodiment, asoftware module is implemented with a computer program productcomprising a computer-readable medium containing computer program code,which can be executed by a computer processor for performing any or allof the steps, operations, or processes described.

Embodiments of the invention can also relate to an apparatus forperforming the operations herein. This apparatus can be speciallyconstructed for the required purposes, and/or it can comprise ageneral-purpose computing device selectively activated or reconfiguredby a computer program stored in the computer. Such a computer programcan be stored in a non-transitory, tangible computer readable storagemedium, or any type of media suitable for storing electronicinstructions, which can be coupled to a computer system bus.Furthermore, any computing systems referred to in the specification caninclude a single processor or can be architectures employing multipleprocessor designs for increased computing capability.

Embodiments of the invention can also relate to a product that isproduced by a computing process described herein. Such a product cancomprise information resulting from a computing process, where theinformation is stored on a non-transitory, tangible computer readablestorage medium and can include any embodiment of a computer programproduct or other data combination described herein.

Finally, the language used in the specification has been principallyselected for readability and instructional purposes, and it can not havebeen selected to delineate or circumscribe the inventive subject matter.It is therefore intended that the scope of the invention be limited notby this detailed description, but rather by any claims that issue on anapplication based hereon. Accordingly, the disclosure of the embodimentsof the invention is intended to be illustrative, but not limiting, ofthe scope of the invention, which is set forth in the following claims.

What is claimed is:
 1. A method of alternating between audio and videomodes during a videoconference, comprising: at a mobile devicecomprising a display, a camera, and a proximity sensor: detecting avideoconference call initialized on the mobile device; initializing, inresponse to detecting the videoconference call, a video mode to placethe mobile device in videoconference operation; detecting, via theproximity sensor, a device orientation being indicative of an audio modeoperating orientation; and switching, in response to the audio modeoperating orientation detected, the mobile device from the video mode tothe audio mode to place the mobile device in teleconference operation.2. The method of claim 1, wherein detecting, via the proximity sensor,the device orientation comprises detecting a proximity of the mobiledevice relative to an ear of a user operating the mobile device.
 3. Themethod of claim 1, wherein switching the mobile device from the videomode to the audio mode comprises: disabling the camera associated withthe mobile device; and generating an avatar on the display associatedwith the mobile device, the avatar being representative of a user usingthe mobile device in the teleconference operation, wherein the avatarreplaces a camera input of the camera.
 4. The method of claim 3, whereinthe avatar is configured to emulate facial movements of the userspeaking using the mobile device.
 5. The method of claim 1, furthercomprising: detecting, via the proximity sensor, the device orientationbeing indicative of a video-mode operating orientation; and switching,in response to the video-mode operating orientation detected, the mobiledevice from the audio mode to the video mode to place the mobile devicein videoconference operation.
 6. The method of claim 5, whereinswitching the mobile device from the audio mode to the video modecomprises: enabling the camera associated with the mobile device; andgenerating a video image on the display associated with the mobiledevice, the video image being generated based on the input received fromthe camera and configured to present a live image of a user using themobile device in the videoconference operation.
 7. A mobile deviceoperated by a user, comprising: a detection mechanism configured todetect a position of the mobile device; a camera; a display configuredto present an image representing the user; and a processor configured toinitialize one of a plurality of operation modes associated with themobile device based on the position detected by the detection mechanism.8. The mobile device of claim 7, wherein the plurality of operationmodes comprises a video mode and an audio mode, the video mode settingthe device in videoconference operation and the audio mode setting thedevice in teleconference operation.
 9. The mobile device of claim 8,wherein the processor is configured to alternate between initializingthe video mode and initializing the audio mode in response to theposition detected.
 10. The mobile device of claim 7, wherein thedetection mechanism is configured to detect the position by determiningproximity of the mobile device relative to the user or by determining anorientation of the mobile device around a specific axis.
 11. The mobiledevice of claim 10, wherein the detection mechanism comprises aproximity sensor for detecting the position of the mobile device. 12.The mobile device of claim 7, wherein the detection mechanism isconfigured to detect the position if the mobile device is in theposition for a predetermined time.
 13. The mobile device of claim 8,wherein the processor is further configured to: disable the camera andto generate an avatar as the image on the display in response to theaudio mode being initialized, and enable the camera and to show thecamera output as the image on the display in response to the video modebeing initialized.
 14. The mobile device of claim 8 further comprising:a first speaker configured to be used in proximity to ears of the userand a second speaker as a speakerphone, wherein the processor is furtherconfigured to route the sound transmission to the first speaker inresponse to the audio mode being initialized, and to the second speakerin response to the video mode being initialized.
 15. A computer-readablestorage medium storing computer-readable instructions, comprising:instructions for detecting an operating orientation of a user device,the operating orientation indicative of an orientation of the userdevice with reference to a user operating the user device; andinstructions for automatically switching an operation mode associatedwith the user device based on the operating orientation detected,wherein the switching includes activating a first set of functionalitiesof the user device and deactivating a second set of functionalities. 16.The computer-readable storage medium of claim 15, wherein the operatingorientation comprises a videoconference operating orientation and ateleconference operating orientation.
 17. The computer-readable storagemedium of claim 16, wherein the instructions for automatically switchingthe operation mode include: instructions for switching the user devicefrom a teleconference mode to videoconference mode in an event theoperating orientation is detected as the videoconference operatingorientation, and instructions for activating a camera associated withthe user device and deactivating a speaker that is configured to be usedin proximity to an ear of the user.
 18. The computer-readable storagemedium of claim 16, wherein the instructions for automatically switchingthe operation mode include: instructions for switching the user devicefrom a videoconference mode to a teleconference mode in an event theoperating orientation is detected as the teleconference operatingorientation, and instructions for deactivating a camera associated withthe user device and activating a speaker that is configured to be usedin proximity to an ear of the user.
 19. The computer-readable storagemedium of claim 15, wherein the instructions for detecting the operatingorientation include instructions for detecting the operating orientationusing a proximity sensor.
 20. The computer-readable storage medium ofclaim 19, wherein the instructions for detecting the operatingorientation include instructions for detecting, using the proximitysensor, proximity of the user device relative to an ear of the user. 21.The computer-readable storage medium of claim 19, wherein theinstructions for detecting the operating orientation includeinstructions for detecting the operating orientation by incorporating atime delay with the proximity sensor, wherein the proximity sensor istriggered after the time delay has passed.
 22. The computer-readablestorage medium of claim 20, wherein automatically switching theoperation mode is based on the proximity of the user device beingproximate to the ear of the user.